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DESCRIPTION 

DIDEOXY DYE TERMINATORS 

FIELD OF THE INVENTION 
This invention relates to dye terminator nucleic 
acid sequencing and reagents for such sequencing. 

\3 BACKGROUND OF THE INVENTION 

■ k 5 The following is a discussion of the relevant art, 

S none of which is admitted to be prior art to the 

hi appended claims . 

Sequence reaction products must be labeled. This 
can be done using labeled primers, labeled nucleotides 
f Hi o (usually radioactive dNTPs) or labeled ddNTP 
4 L terminators. The use of labeled terminators has the 
Q advantage of leaving false-stops undetectable. 

DNA sequence bands do not necessarily have uniform 
intensities. It is useful to express band intensity 
15 variability numerically. This can be done by reporting 
the ratio of maximum to minimum intensity of nearby 
bands (within a window of perhaps 40 bases) in a DNA 
sequence or, with normalization and correction for 
systematic "drift" in intensity by reporting the root 
2 0 mean square of band intensities (typically peak 

heights) (Fuller, C.W., Comments 16(3) :l-8, 1989). It is 
advantageous to have uniformity of band intensity as 
sequence accuracy and read- length is improved with bands 
of more uniform intensity. 
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For accurate reading, the mobility of any given 
sequencing reaction product must migrate through the 
electrophoresis gel with a speed proportional only to 
its length. Products which migrate faster or slower 
5 than normal for a given length will result in sequence 
ambiguities or errors known as "compressions". 

Anomalous migration speed can be caused by 
secondary structure of the DNA and is apparently the 
cause of most "compression" artifacts seen in 

10 radioactive- label (and other) sequencing experiments at 
GC-rich regions. These can often be resolved by the use 
of analogs of dGTP such as 7-deaza-dGTP or dITP. 
Another compression- like artifact is observed when some 
dye -labeled ddNTPs are used for sequencing. Several 

15 examples of this can be seen in Lee, L.G., Connell, 
C.R., Woo, S.L., Cheng, R.D., McArdle, B.F., Fuller, 
C.W., Halloran, N.C., and Wilson. R.K., Nucleic Acids 
Res., 20:2471-2483, 1992 (see figures 4g, 4h and 6h 
using ddCTP labeled with tetramethylbodipy and TMR or 

2 0 ddGTP labeled with bifluor) . These compression- like 

artifacts are produced, even in sequences which are 
compression- free when sequenced radioactively or with 
dye-labeled primers. These artifacts can sometimes be 
eliminated by substituting dITP for dGTP or alpha-thio 
25 dNTPs for normal dNTPs (Lee, L.G. et al., Nucleic Acids 
Res., 20: 2471-2483, 1992; U.S. Patent No. 5,187,085). 
Similar artifacts seen with the fluorescein dye-labeled 
ddNTPs sold by Applied Biosystems for dye-terminator 
sequencing with T7 DNA polymerase are resolved by 

3 0 substituting alpha-thio dNTPs for normal dNTPs (Lee, 

SSSD/43876. V01 
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L.G. et al., Nucleic Acids Res . , 20: 2471-2483, 1992; 
U.S. Patent No. 5,187,085). 

Prober, J.M., Trainor, G.L., Dam, R.J., Hobbs, 
F.W., Robertson, C.W., Zagursky, R.J., Cocuzza, A.J,, 
5 Jensen, NLA. and Baumeister, K. , Science 238:336-41 
(1987) performed sequencing using terminators labeled 
with substituted succinyl- fluoresceins with linkers of 
10 atoms in length, together with dATP, dCTP, dTTP, 7- 
deaza-dGTP and AMV reverse transcriptase, and a 

10 fluorescence -detecting instrument. From Fig. 6 of this 
paper is clear that overall band intensities varied by 
more than 10 -fold, far more than the best available 
current methods with dye primers or radioactive labels. 
Dideoxy NTP terminators that have the same basic 

15 structure as the Prober et al. (1987) terminators, but 
have four rhodamine dyes used in place of the succinyl 
fluoresceins and linkers of 5 atoms in length, have been 
used for sequencing with Taq polymerase. In order to 
use these terminators, dITP is used in place of dGTP or 

2 0 7-deaza-dGTP to eliminate severe "compression" 

artifacts. This method has been practiced using cloned 
Taq DNA polymerase (Bergot, WO 9105060; Parker, L.T., 
Deng, Q, Zakeri, H., Carlson, C. Nickerson, D.A., Kwok, 
P.Y., Biotechniques 19 (1) : 116-121, 1995) and with a 

25 mutant of Taq polymerase (D49G, AmpliTaq CS) lacking 5'- 
3 1 exonuclease activity. However, band intensities vary 
by as much as 20 -fold, limiting the accuracy and read- 
length possible with the method (Parker, L.T., Zakeri, 
H., Deng, Q. , Spurgeon, S., Kwok, P.Y., Nickerson, D.A., 

30 Bio techniques 21 (4) : 694-699, 1996). 

SSSD/43876. vOl 
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Lee, L.G., Connell, C.R., Woo, S.L., Cheng, R.D., 
McArdle, B.F., Fuller, C.W., Hallorand, N.D. and Wilson, 
R.K., Nucleic Acids Res . , 20 :2471, 1992 ) describe 
sequencing with a set of ddNTP terminators and T7 DNA 
5 polymerase. All have fluorescein- type dyes attached to 
the ddNTPs in essentially the same manner as the 
rhodamine terminators used for Taq sequencing. These 
are used with modified T7 DNA polymerase {Sequenase™ 
version 2.0) and alpha-thio dNTPs . The thio dNTPs are 

10 used to resolve the "compression" artifacts like dITP is 
used for the Taq dye -terminator methods. The results 
with this system are such that bands vary in intensity 
about 10 -fold. 

Wayne Barnes has published a protocol for dye- 

15 terminator sequencing with FY modified polymerases and 
Mn 2+ (Scientech Corp. St. Louis, MO) . Bands are more 
uniform with this method varying about 4.5-fold at most. 

Fluorescein- 12 ddNTPs that have a linker length of 
12 atoms and Biotin-11 ddNTPs that have a linker length 

2 0 of 11 atoms are available (Dupont NEN, Wilmington, DE) . 
These labeled ddNTPs are described as useful in 
sequencing reactions. 

ABI PRISM disclose dichlororhodamine dyes linked to 
terminators by propargyl/ ethylene oxide/amino ( W E0") 

25 linkers eight atoms in length for sequencing (Rosenblum, 
B.B., Lee, L.G., Spurgeon, S.L., Khan, S.H., Menchen, 
S.M., Heiner, C.R., and Chen, S.M., Nucleic Acids Res. 
25 (22) :4500-4504, 1997). 

SSSD/43876. vOl 
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Cyanine dyes have been utilized in dye terminators 
for sequencing (Lee et al., Nucleic Acids Res., 
20 (10) :2471, 1992) . 

SUMMARY OF THE INVENTION 
The present invention provides novel dideoxy dye- 
labeled terminators which are useful in a number of 
biological processes, including providing uniform band 
intensities and the resolution of dye- induced 
compression artifacts in DNA sequencing. The dideoxy 
dye- labeled terminators of the present invention are 
particularly well suited for use with DNA polymerases 
that are thermostable or which contain an altered dNMP 
binding site (Tabor et al., U.S. Patent No. 5,614,365). 
Use of the terminators of the present invention for 
sequencing does not require the use of nucleotide 
analogs such as dITP or alpha-thio nucleotides to 
eliminate dye-induced compression artifacts. Applicant 
has surprisingly found that there is a strong 
correlation between the length of the link between the 
dye molecule and the nucleotide and band uniformity, but 
little correlation between the type of dye (or other 
parameters) and band uniformity. Dye terminators with 
linkers of 10 or more atoms (extended linkers) up to 25 

atoms (10, 11, 12 25) when used in sequencing 

reactions produce bands in sequencing gels of 
significantly improved uniformity compared with dye 
terminators with linkers less than 10 atoms. 

The dye termininators of the present invention with 
extended linkers typically are provided in groups of 

SSSD/43876. vOl 
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four (ATGC) with or without a thermostable DNA 
polymerase and are especially useful in a method of 
sequence analysis. 

In a first aspect, the invention features a kit for 
5 DNA sequencing having a first, second, third and fourth 
dye terminator molecule, each of the dye terminator 
molecules has a dye molecule, a linker of at least 10 
atoms in length and either ddATP, ddCTP, ddGTP or ddTTP 
as a mono or tri -phosphate and a thermostable DNA 

1 0 polymerase . 

By "dye molecule" is meant any molecule that has a 
detectable emission spectrum, including but not limited 
to fluorescein, rhodamine, texas red, eosin, lissamine, 
coumarin, cyanine, and derivatives of these molecules. 

15 Dyes also include energy transfer dyes each comprising a 
donor and an acceptor dye. 

By "linker" is meant a chain of at least 10 atoms 
comprising carbon, nitrogen, and oxygen which links the 
dye molecule with the dideoxynucleotide. The chain may 

20 also contain substituted carbon or sulfur. Linkage 
typically occurs at the aromatic base moiety of the 
nucleotide. The first two atoms of the linker attached 
to the base are typically joined in a triple bond. 

By "substituted carbon " is meant that one or more 

25 hydrogens are replaced with a substitute group such as, 
but not limited to, hydroxyl, cyano, alkoxy, oxygen, 
sulfur, nitroxy, halogen, -N(CH 3 ) 2 , amino, and -SH. 

By "thermostable DNA polymerase" is meant a DNA 
polymerase has a half-life of greater than 5 minutes at 

3 0 90 °C. Such polymerases include, but are not limited to, 

SSSD/43876. vOl 
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DNA polymerases encoded by Thermus aqaaticus, Thermus 
thermophilics, Thermus flavus, Thermococcus littoralis, 
Pyrococcus furiosus, Thermotoga maritima, and Thermotoga 
neapolitana. 

5 In a preferred embodiment the thermostable DNA 

polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
to the natural polymerase. A DNA polymerase with an 
altered dNMP binding site does not discriminate 

10 significantly between dideoxynucleotides and 

deoxynucleotides . The chance of incorporating a 
dideoxynucleotide is approximately the same as that of a 
deoxynucleotide or at least 1/10 the efficiency of a 
deoxynucleotide . 

15 In a second aspect the invention features a 

compound of formula (I) 

A 

B 
C 

A is a cyanine dye of the structure 




wherein the curved lines represent carbon atoms 
necessary for the formulation of cyanine dyes; X and Y 
20 are selected from the group consisting of 0, S, and 

SSSD/43876. vOl 
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CH 3 -C-CH 3 ; m is an integer selected from the group 
consisting of 1, 2, 3, and 4; Rl, R2, R3 , R4, R5, R6, 
and R7 are independently selected from the group 
consisting of H, OH, C0 2 H, sulfonic acid or sulfonate 
5 groups, esters, amides, ethers, alkyl or aryl groups, 
and B and one Rl, R2, R3, R4, R5, R6 or R7 is B. 

B is a linker of at least 10 atoms in length 
wherein the atoms are selected from the group consisting 
of carbon, nitrogen, oxygen, substituted carbon and 
10 sulfur and the linker is attached at one end to A and at 
the other end to C. 

C is a dideoxynucleotide selected from the group 
consisting of: 




R-0 




R — 0 




and 



and wherein the linker is covalently bonded to the 
15 dideoxynucleotide at position 7 for the purines (ddG, 
ddA) and at position 5 for the pyrimidines (ddT, ddC) 
and wherein r is a mono or tri -phosphate. 

The term "sulfonic acid or sulfonate groups'' refer 
to S0 3 H groups or salts thereof. 
20 The term "ester" refers to a chemical moiety with 

formula -(R)n-COOR', where R and R' are independently 
selected from the group consisting of saturated or 
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unsaturated alkyl and f ive-membered or six-membered aryl 
or heteroaryl moieties and where n is 0 or 1. 

The term "amide" refers to a chemical substituent 
of formula -NHCOR, where R is selected from the group 
5 consisting of hydrogen, alkyl, hydroxyl, and f ive- 
membered or six-membered aryl or heteroaryl ring 
moieties, where the ring is optionally substituted with 
one or more substituents independently selected from the 
group consisting of alkyl, halogen, trihalomethyl, 

10 carboxylate, nitro, or ester. 

The term u ether" refers to a chemical moiety with 
formula R-O-R' where R and R' are independently selected 
from the group consisting of saturated or unsaturated 
alkyl and f ive-membered or six-membered aryl or 

15 heteroaryl moieties and where n is 0 or 1. 

The term "alkyl" refers to a straight -chain or 
branched aliphatic hydrocarbon. The alkyl group is 
preferably 1 to 10 carbons, more preferably a lower 
alkyl of from 1 to 7 carbons, and most preferably 1 to 4 

2 0 carbons. Typical alkyl groups include methyl, ethyl, 
propyl, isopropyl, butyl, isobutyl, tertiary butyl, 
pentyl, hexyl and the like. The alkyl group may be 
substituted and some typical alkyl substituents include 
hydroxyl, cyano, alkoxy, oxygen, sulfur, nitroxy, 

25 halogen, -N(CH 3 ) 2 , amino, and -SH. 

The term "aryl" refers to an aromatic group which 
has at least one ring having a conjugated pi electron 
system and includes both carbocyclic aryl (e.g. phenyl) 
and heterocyclic aryl groups (e.g. pyridine). The term 

30 "carbocyclic" refers to a compound which contains one or 

SSSD/43876. vOl 
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more covalently closed ring structures, and that the 
atoms forming the backbone of the ring are all carbon 
atoms. The term thus distinguishes carbocyclic from 
heterocyclic rings in which the ring backbone contains 
5 at least one atom which is different from carbon. The 
term "heteroarly" refers to an aryl group which contains 
at least one heterocyclic ring. 

In a preferred embodiment the linker is selected 
from the group consisting of: 

10 -C=C-CH 2 -NH-CO- (CH 2 ) 5 -NH-CO-, 
-C=C-CH 2 -NH-CO- (CH 2 ) 9 -NH-S0 3 - , 
-ChC-CH 3 -NH-CO- (CH 3 ) 10 -NH-CO-, 
-C=C-CH 2 -NH-C0- (CH 2 ) 5 - , 

-ChC-CH 2 -NH-C0- (CH 2 ) 5 -NH-CO- (CH a ) 5 -, and 
15 -C=C-CH 3 -NH-C0- (CH 2 ) 5 -NH-C0- (CH 3 ) 10 -NH-CO- 

In preferred embodiments the dideoxy dye 
terminators are; a compound of the formula (II) : 




11 

;a compound of the formula (III) 
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; a compound of the formula (IV) : 
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/compound of the formula (V) : 
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The Cy-5.5 ddGTP and ddCTP compounds have a linker 
5 of 10 atoms in length. The Cy-5.5 ddCTP and ddTTP 
compounds have a linker of 17 atoms in length. 

In a third aspect the invention features a 
deoxyribonucleic acid sequence containing the compound 
of formula I, II, III ,IV or V. 
10 In a preferred embodiment the invention features a 

kit for DNA sequencing comprising compounds of formula 
II, III, IV, and V. 
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In a further preferred embodiments the kit further 
has a thermostable DNA polymerase; the thermostable DNA 
polymerase has an altered dNMP binding site so as to 
improve the incorporation of dideoxynucleotides relative 
5 to the natural polymerase. 

Applicant has surprisingly found that the one 
parameter that most strongly correlates with band 
uniformity is the length of the linker between the dye 
and the ddNTP. Applicant has found that by extending ' 

10 the linker length between the dye and the nucleotide for 
any dye:ddNTP combination to at least 10 atoms, that 
band uniformity is substantially improved and there are 
no dye-induced compression artifacts. 

Thus, in a fourth aspect, the invention features a 

15 method for determining the nucleotide base sequence of a 
DNA molecule consisting of the steps of incubating a DNA 
molecule annealed with a primer molecule able to 
hybridize to the DNA molecule in a vessel containing a 
thermostable DNA polymerase, a dye terminator with a 

20 linker of at least 10 atoms between the dye and the 

nucleotide and separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of the DNA molecule can be 
determined. 

25 In preferred embodiments, the dye terminator is a 

compound of formula I, II, III, IV or V; the 
thermostable DNA polymerase has an altered dNMP binding 
site. 
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Other features and advantages of the invention will 

be apparent from the following description of the 

preferred embodiments thereof, and from the claims. 
All articles, publications and patents cited in 

this application are hereby incorporated by reference, 

in their entirety, 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 presents DNA sequence data generated using 
M13mpl8 containing a 115 bp SauAI fragment from lambda 
inserted a the BamHI site and Cy5.5 ddGTP, ddATP, ddTTP, 
and ddCTP dye terminators. 

Fig. 2 is a graph of band intensity variability 
(rms) vs linker length (atoms) . 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The following Examples are provided for further 
illustrating various aspects and embodiments of the 
present invention and are in no way intended to be 
limiting of the scope. 

Example 1: Synthesis of dideoxy dye terminators 

Cv 5.5 dideoxy nucleoside triphosphates 
Dye terminators labeled with Cy5.5 were prepared 
from propargylaminodideoxynucleotids (Prober, J.M., 
Trainor, G.L., Dam, R.J., Hobbs, F.W., Robertson, C.W., 
Zagursky, R.J., Cocuzza, A.J., Jensen, M.A. and 
Baumeister, K. , Science 238:336-41 (1987); U.S. Patent 
Nos. 5,242,796, 5,306,618, and 5,332,666) and "CyDye 
Fluorolink Cy5 . 5 mono reactive dye" product PA25501 
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(Amersham Life Science) to produce compounds II/ III, 
IV, and V. In the case of ddG and ddA, the 
propargylaminonucleotide was directly reacted with the 
N-hydroxysuccinimidyl ester of the Cy5.5 dye. In the 
5 case of ddC and ddT, a longer linker was constructed by 
reacting the propargylaminonucleotide with the N- 
hydroxysuccinimidyl ester of N-trif luoroacetyl-6- 
aminocaproic acid followed by hydrolysis in aqueous 
ammonia of the trif luoroacetyl group. The resulting 

10 compound was then reacted with the N-hydroxysuccinimidyl 
ester of the Cy5.5 dye to give the 17 -atom linker 
between the Cy 5.5 dye and the pyrimidine base . 

In addition to Cy 5.5 dyes, those who practice the 
art would know how to identify and utilize other dyes, 

15 including other cyanine dyes, with the appropriate 
optical properties. Also, the construction and 
attachment of various linkers is well known in the art. 
Suitable reagents for linker construction include one or 
more compounds consisting of activated forms of amino- 

2 0 protected alkyl or aryl amino acids such as compounds of 

the formula R-NH- (CH 2 ) n ~C0 2 R' or R-NH- (CH 2 ) n X(CH 2 ) * m -C0 2 R' , 
where R is an acid- or base-labile protecting group, R' 
is a reactive ester or anhydride group, X is aryl, 0, S, 
or NH, and where n and m are 0-12. Other linkers 
25 constructed by N- or O- or S- alkylation are also 
suitable. The exact linker length, of at least 10 
atoms, for a specific dye and dideoxynucleotide 
combination can be determined empirically by monitoring 
band uniformity in DNA sequencing as described (see 

3 0 Example 3) . 

r-i t~\ nr\ I a -\ r> *-i ^ -i 
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Example 2 : Dve terminator cycle sequencing 

DNA cycle sequencing was carried out using Thermo 
Sequenase™ DNA polymerase (Amersham, Cleveland, OH) and 
Cy5.5 dideoxy dye terminators using the following cycle 
sequencing protocol : 

1. A master mix was prepared consisting of the 
following: 



Template DNA 5 . o/zl 

10X Reaction buffer (see below) 3.5^1 

Primer, 2(M 1.0/j.l 

Polymerase (see below) 2/x 

H 2° 15 . Spl 

Total volume 27.0^1 



10X Reaction Buffer: 
150 mM Tris HCL pH 9.5 
35mM MgCl 2 

Polymerase: Thermo Sequenase™ DNA polymerase, 
lOU/^1, 0.0017U/a*1, Thermoplasma acidophilus! inorganic 
pyrophosphatase: 20mM Tris-HCl, pH 8.5, lmM DTT, 0 . ImM 
EDTA, 0.5% Tween-20, 0.5% Nonidet P-40 and 50% glycerol. 
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2 . Four microcentrifuge tubes were labeled and 2 yl of 
Cy5.5 labeled ddG, ddA, ddT, ddC solution was added to 
each tube. 

25:1 ddG Mix, 300 (M each of dGTP, dATP, dTTP & dCTP, 
5 12/iM Cy5.5 ddGTP 

25:1 ddA Mix, 300 (M each of dGTP, dATP, dTTP & dCTP, 
12/iM Cy5.5 ddATP 

25:1 ddT Mix, 3 00 /jM each of dGTP, dATP, dTTP & dCTP, 
12 (M Cy5.5 ddTTP 
10 25:1 ddC Mix, 300 juM each of dGTP, dATP, dTTP & dCTP, 
12/zM Cy5.5 ddCTP 

3. Six jul of the master mix (from step 1) was 
aliquoted to each of the 4 tubes from step 2 above. 
Cycling was carried out as follows: 95°C (30 sec), 45- 

15 55°C (30 sec) and 72°C (60 sec) for 35 cycles then 
incubate at 72 °C 5-7 minutes. 

4 . One jUl of 8M ammonium acetate was added to each 
tube. Theii 27 yCl (approximately 3 times the reaction 
volume) of chilled 100% ethanol was added. Then mixture 

20 was mixed and placed on ice for 20 minutes to 
precipitate the DNA. 

5. The mixture was centrifuged in a microcentrifuge 
(-12, OOOrpm) for 20-30 minutes at either room 
temperature or 4°C. The supernatant was removed and 

25 then 200 yX of 70% ethanol was added to wash the DNA 
pellet . 

o C 1 on / a -> o s r- .-/it 
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6. The mixture was again centrifuged for 5 minutes, 
the supernatant removed and the pellet dried (in a 
vacuum centrifuge) for 2-3 minutes. 

7. Each pellet was resuspended in 6 jtl of formamide 
loading dye (Amersham, Cleveland, OH) , vortexed 
vigorously (10-20 sec) to ensure that all DNA was 
dissolved. The mixture was briefly centrifuged to 
collect the sample at the bottom of the tube. 

8. Samples were heated to 70°C for 2-3 minutes to 
denature the DNA, then placed on ice. 

9. Then 1.5-2 pi of the volume was loaded onto a lane 
of the sequencing gel, and the gel run on the MICRO Gene 
Blaster instrument (VGI) . 

For this sequence, the template DNA was M13mpl8 
containing a 115 bp Sau3AI fragment from bacteriophage 
lambda inserted at the BamHI site (product number US 
70171 Amersham) . The primer is the -40 Forward 23-mer 
universal primer (5 • -GTTTTCCCAGTCACGACGTTGTA- 3 ' ) (SEQ. 
ID. NO. 1) . Results are shown in Figure 1. 

Example 3: Correlat ion of linker length and hanti 

intensity variability 
Sequencing reactions were carried out as described 
in example 2 with various dye molecules linked to 
dideoxynucleotides with linkers of various lengths (see 
Table 1) . The labeled DNA products were then separated 

SSSD/43876. vOl 



20 225/219 

on denaturing polyacrylamide gels and the labeled 
products were detected by fluorescence. The intensity 
of the bands is taken as the height of the peaks in a 
graph of fluorescence (in arbitrary units) against time. 
5 Typically, systematic variations in peak heights can be 
seen in graphs of peak heights plotted sequentially. 
These systematic variations in the peak heights can be 
modeled by least-square fitting to a second-order 
polynominal function. Dividing the peak height for each 
10 band by the value of the curve-fit polynomial function 
yields a normalized band intensity for each peak. 
Variation in these band intensities can be expressed as 

the square root of the variance ^ (nSx 2 - (Sx) 2 /n 2 } of the 

normalized peak heights, which can typically have values 
15 between 0 and 1 with more variability represented by 

higher numbers (Fuller, C.W., Comments 16(3) :l-8, 1989). 
This value is numerically equal to root -mean- square 
(RMS) value when 1.0 is subtracted from the normalized 
peak heights. These values are reported in Table 1 and 
20 graphed in Fig. 2. Variability of band intensities is 
significantly reduced when linkers of 10 or more atoms 
in length were used, resulting in sequence data that was 
easier to interpret accurately. 



Table 1 





Base 


Dye a 


Linker 
Length 15 


Band Uniformity 
(rms) 


1 


T 


Coumarin 


5 C 


0.32 


2 


G 


Lissamine 


5 d 


0 .77 
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R110 



0.34 



R6G 



0.32 



R6G 



0.57 



ROX 



0.36 



TMR 



0.47 



TxR 



5 d 



0.61 



Eos in 



0.40 



10 



Cy3 



10 1 



0.24 



11 



Cy5 



10 1 



0.15 



12 



Cy5 



10* 



0.21 



13 



Cy5.5 



10 1 



0.21 



14 



Cy5.5 



10* 



0.20 



15 



Fl 



12 f 



0.16 



16 



Fl 



12 £ 



0.20 



17 



Fl 



12 f 



0.17 



18 



Fl 



12 f 



0.18 



19 



R6G 



12 £ 



0.13 



20 



R6G 



12 f 



0.25 



21 



ROX 



12 f 



0.21 



22 



ROX 



12 f 



0.16 



23 



TMR 



12 f 



0.26 



24 



TMR 



12 £ 



0.29 



25 



TMR 



12 f 



0.37 



26 



TxR 



169 



0.32 



27 



TxR 



16^ 



0.24 



28 



TxR 



169 



0.22 



29 



30 



U 



TxR 



Cy3-Cy5 



169 



17* 



0.24 



0.11 



31 



Cy3-Cy5 



173 



0.16 



32 



Cy3-Cy5 



17* 



0.22 
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33 


T 


Cy3-Cy5 


17* 


0.11 


34 


C 


Cy5 


17* 


0 . 14 


35 


T 


Cy5 


17* 


0 10 


36 


C 


Cy5.5 


17* 


n on 
u . z u 


37 


T 


Cy5.5 


17* 


0 . 18 


38 


A 


Fl 


17 h 


0.16 


39 


C 


Fl 


17 h 


0.24 


40 


G 


Fl 


17 h 


0.18 


41 


T 


Fl 


17 n 


0.25 


42 


T 


Fl 


24* 


0.24 



a Abbreviations for dyes: Fl, Carboxyf luorescein; Rlio, Rhodamine 
110; R6G, Rhodamine 6G; ROX, Rhodamine X; TMR, tetramethylrhodamine; 
TXR, Texas Red (Molecular Probes). The dyes Cy3, Cy3.5, Cy5 and 
Cy5.5 were from Amersham Life Science, Cleveland, OH. 



b Linker length is the number of atoms between the ring structure of 
the nucleoside base (A, C, G or T) and the ring structure of the 
dye. 

Linker structures 

c -C=C-CH 2 -NH-C0- 

d -C=C-CH 2 -NH-S0 2 - 

e -ChC-CH 2 -NH-CS-NH- 

f -C=C-CH 2 -NH-CO-(CH 2 ) 5 -NH-CO- 

9 -ChC-CH 2 -NH-CO-(CH 2 ) 9 -NH-S0 2 - 

h -C=C-CH 2 -NH-CO-(CH 2 ) 10 -NH-CO- 

1 -ChC-CH 2 -NH-CO-(CH 2 ) 5 - 

* -C^C-CH 2 -NH-CO- (CH 2 ) 5 -NH-C0- (CH 2 ) 5 - 

k -C=C-CH 2 -NH-CO- (CH 2 ) 5 -NH-C0- (CH 2 ) 10 -NH-CO- 
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(1) GENERAL INFORMATION: 



(I) APPLICANT: 



Kumar, Shiv 
Nampalli, Stayam 
McArdle, Bernard F. 
Fuller, Carl w. 



(ii) TITLE OF INVENTION: 



DIDEOXY DYE TERMINATORS 



(iii) NUMBER OF SEQUENCES : 



(iv) CORRESPONDENCE ADDRESS 



(A) 
(B) 

(O 
(D) 
(E) 
(F) 



ADDRESSEE : 
STREET : 

CITY: 
STATE : 
COUNTRY: 
ZIP: 



Lyon & Lyon 

633 West Fifth Street 

Suite 4700 

Los Angeles 

California 

U.S.A. 

90071-2066 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE : 



3.5" Diskette, 1.44 Mb 
storage 

IBM Compatible 
IBM P.C. DOS 5.0 
FastSEQ for Windows 2 . 0 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



To Be Assigned 
Herewith 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME : 

(B) REGISTRATION NUMBER: 

(C) REFERENCE / DOCKET NUMBER: 



Warburg, Richard J. 

32,327 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (213) 489-1600 

(B) TELEFAX: (213) 955-0440 

(C) TELEX: 67-3510 



(2) INFORMATION FOR SEQ ID NO: 1: 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GTTTTCCCAG TCACGACGTT GTA 23 



Other embodiments are within the following claims. 



QCCn/ylTOTC trill 
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CLAIMS 

1. A kit for DNA sequencing comprising: 

a first, second, third and fourth dye terminator 
molecule, each of the dye terminator molecules comprising 
5 a dye molecule, a linker of at least 10 atoms in length 
and either ddATP, ddCTP, ddGTP or ddTTP as a mono or tri- 
phosphate and a thermostable DNA polymerase. 

2. The kit of claim 1, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 

10 binding site so as to improve the incorporation of 
dideoxynucleotides relative to the natural polymerase. 

3. A compound of formula (I) : 

A 

B 
C 

15 wherein A is a cyanine dye of the structure 




and the curved lines represent carbon atoms necessary for 
the formulation of cyanine dyes, X and Y are selected from 
the group consisting of O, S, and 
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CH 3 -C-CH 3 , m is an integer selected from the group 
consisting of l, 2, 3, and 4, Rl, R2, R3, R4, R5, R6 and 
R7 are independently selected from the group consisting of 
H, OH, C0 2 H, sulfonic acid or sulfonate groups, esters, 
5 amides, ethers, alkyl or aryl groups and B, and one Rl, 
R2, R3, R4, R5, R6 or R7 is B ; 

B is a linker of at least 10 atoms in length wherein 
the atoms are selected from the group consisting of 
carbon, nitrogen, oxygen, substituted carbon, and sulfur 
10 and the linker is attached at one end to A and at the 
other end to C; and 

C is a dideoxynucleotide selected from the group 
consisting of 




wherein said linker is covalently bonded to said 
15 dideoxynucleotide at position 7 for ddA and ddG and at 
position 5 for ddC and ddT and wherein r is a mono or tri- 
phosphate . 
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4. The compound of claims 3, wherein said linker is 

selected from the group consisting of 

-Chc-ch 2 -nh-co- (ch 2 ) s -nh-co- , 
-c=c-ch 2 -nh-co- (ch 2 ) 9 -nh-s0 2 - , 

5 -CsC-CH 2 -NH-CO- (CH 2 ) 10 -NH-CO- , 
-C=C-CH 3 -NH-CO- (CH 2 ) s - , 

-C=C-CH 2 -NH-CO- (CH 3 ) s -NH-CO- {CH 3 ) S -, and 
-ChC-CH 2 -NH-CO- (CH 2 ) 5-NH-CO- (CH 2 ) 10 -HH-CO- . 

5. A compound of the formula (II) : 




28 

A compound of the formula (III) : 



225/219 




29 

7. A compound of the formula (IV) : 
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8 . A compound of the formula (V) : 
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9. A deoxyribonucleic acid sequence containing the 
5 compound of formula I. 

10. A deoxyribonucleic acid sequence containing the 
compound of formula II, III, IV, or V. 

11. A kit for DNA sequencing comprising compounds of 
10 formula II, III, IV, and V. 
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12. The kit of claim 11, further comprising a 
thermostable DNA polymerase. 

13. The kit of claim 12, wherein said polymerase is 
a thermostable DNA polymerase that has an altered dNMP 
binding site so as to improve the incorporation of 
dideoxynucleotides relative to the natural polymerase. 

14. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
molecule able to hybridize to said DNA molecule in a 
vessel containing a thermostable DNA polymerase, one 
of a set of four dye terminators with an linker of at 
least 10 atoms between the dye and the nucleotide and 

separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can 
be determined. 

15. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
molecule able to hybridize to said DNA molecule in a 
vessel containing a thermostable DNA polymerase, a 
compound of formula I and 

separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can 
be determined. 
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16. Method for determining the nucleotide base 
sequence of a DNA molecule comprising the steps of: 

incubating a DNA molecule annealed with a primer 
molecule able to hybridize to said DNA molecule . in a 
vessel containing a thermostable DNA, a compound of 
formula II, III, IV, or V and 

separating DNA products of the incubating 
reaction according to size whereby at least a part of 
the nucleotide base sequence of said DNA molecule can 
be determined. 

17. The method of any of claims 14, 15, or 16 
wherein said polymerase is a thermostable DNA polymerase 
that has an altered dNMP binding site so as to improve the 
incorporation of dideoxynucleo tides relative to the 
natural polymerase. 
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ABSTRACT 

A kit for DNA sequencing comprising a first, second, 
third and fourth dye terminator molecules, each of the dye 
terminator molecules comprising a dye molecule, a linker 
of at least 10 atoms in length and either ddATP, ddCTP, 
ddGTP or ddTTP as a mono or tri -phosphate and a 
thermostable DNA polymerase. 




FIGURE 2 
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COMBINED DECLARATION AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that; 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor 
(if plural names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled 

DIDEOXY DYE TERMINATORS the specification of which 

is attached hereto. 

X was filed on February 4. 1998 as Application Serial No. 09/018,695 and was amended on . 



I hereby state that I have reviewed and understand the contents of the above-identified specification, including the claims, 
as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to the patentability and/or examination of this application 
in accordance with Title 37, Code of Federal Regulations, § 1.56(a). 

I hereby claim foreign priority benefits under Title 35, United States Code, §1 19 of any foreign application(s) for patent or 
inventor's certificate listed below and have also identified below any foreign application for patent or inventor's certificate having 
a filing date before that of the application on which priority is claimed: 
Prior Appiication(s): 



(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 


(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 


(Number) 


(Country) 


(Day/Month/Year Filed) 


Yes 


No 



I hereby claim the benefit under Title 35, United States Code, § 1 19(e) of any United States provisional application(s) listed 

below. 



(Application Number) (Filing Date) 

I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) listed below and, 
insofar as the subject matter of each of the claims of this application is not disclosed in the prior United States application in the 
manner provided by the first paragraph of Title 35, United States Code, §112, I acknowledge the duty to disclose material 
information as defined in Title 37, Code of Federal Regulations, § 1.56(a) which occurred between the filing date of the prior 
application and the national or PCT international filing date of this application: 



(Application Serial No.) (Filing Date) (Status) 



(Application Serial No.) (Filing Date) (Status) 
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I hereby appoint the following attorney(s) and/or agent(s) to prosecute this application and to transact all business in the 
Patent and Trademark Office connected therewith: Richard J. Warburg, Esq., Registration No. 32.327 



Kindly recognize as associate attorney: 

Roland N. Smoot, Reg. No. 18,718; Conrad R. Solum, Jr. Reg. No. 20,467; James W. Geriak, Reg. No. 20,233; 
Robert M. Taylor, Jr., Reg. No. 19,848; Samuel B. Stone, Reg. No, 19,297; Douglas E. Olson, Reg. No. 22,798; 
Robert E. Lyon, Reg. No. 24,171; Robert C. Weiss, Reg. No. 24,939; William E. Thomson, Jr., Reg. No. 29,719; 
Richard E. Lyon, Jr., Reg. No. 26,300; John D. McConaghy, Reg. No. 26,773; William C. Steffin, Reg. No, 26,81 1; 
Coe A. Bloomberg, Reg. No. 26,605; J. Donald McCarthy, Reg. No. 25,1 19; John M. Benassi, Reg. No. 27,483; 
James H. Shalek, Reg. No. 29,749; Allan W. Jansen, Reg. No. 29,035; Robert W. Dickerson, Reg. No. 29,914; Roy 
L. Anderson, Reg. No. 30,240; David B. Murphy, Reg. No. 31,125; James C. Brooks, Reg. No. 29,898; Jeffrey M. 
Olson, Reg. No. 30,790; Steven D. Hemminger, Reg. No. 30,755; Jerrold B. Reilly, Reg. No. 32,293; Paul H. Meier, 
Reg. No. 32,274; John A. Rafter, Jr., Reg. No. 31,653; Kenneth H. Ohriner, Reg. No. 31,646; Mary S. Consalvi, 
Reg. No. 32,212; Bradford J. Duft, Reg. No. 32,219; Suzanne L. Biggs, Reg. No. 30,158; F.T. Alexandra Mahaney, 
Reg. No. 37,668; Sheldon O. Heber, Reg. No. 38,179; Jeffrey W. Guise, Reg. No. 34,613; Charles S. Berkman, Reg. 
No. 38,077; Anthony C Chen, Reg. No. 38,673; and Wesley B. Ames, Reg. No. 40,893 of LYON & LYON, 633 
West Fifth Street, Suite 4700, Los Angeles, California 90071-2066 

Address all telephone calls to Richard J. Warburg. Esq. at telephone no. 619 552-8400 . 

Address all correspondence to Richard J. Warburg. Esq.. LYON & LYON LLP. 633 West Fifth Street, Suite4700. Los Angeles 
CA 90071 . 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information 
and belief are believed to be true; and further that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patents issuing thereon. 
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